NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The Wisdom of Hindsight Makes Language Models Better Instruction Followers

Zhang, Tianjun; Liu, Fangchen; Wong, Justin; Abbeel, Pieter; Gonzalez, Joseph E (July 2023, PMLR)

Reinforcement learning has seen wide success in finetuning large language models to better align with instructions via human feedback. The so-called algorithm, Reinforcement Learning with Human Feedback (RLHF) demonstrates impressive performance on the GPT series models. However, the underlying reinforcement learning algorithm is complex and requires additional training for reward and value networks. In this paper, we consider an alternative approach: converting feedback to instruction by relabeling the original one and training the model for better alignment in a supervised manner. Such an algorithm doesn’t require any additional parameters except for the original language model and maximally reuses the pretraining pipeline. To achieve this, we formulate instruction alignment problem for language models as a goal-reaching problem in decision making. We propose Hindsight Instruction Relabeling (HIR), a novel algorithm for aligning language models with instructions. The resulting two-stage algorithm shed light to a family of reward-free approaches that utilize the hindsightly relabeled instructions based on feedback. We evaluate the performance of HIR extensively on 12 challenging BigBench reasoning tasks and show that HIR outperforms the baseline algorithms and is comparable to or even surpasses supervised fine-tuning. The implementation of HIR is available at https://github.com/tianjunz/HIR.
more » « less
Full Text Available
Masked Autoencoding for Scalable and Generalizable Decision Making

Liu, Fangchen; Liu, Hao; Grover, Aditya; Abbeel, Pieter (January 2023, Advances in neural information processing systems)

Full Text Available
Masked World Models for Visual Control

Seo, Younggyo; Hafner, Danijar; Liu, Hao; Liu, Fangchen; James, Stephen; Lee, Kimin; Abbeel, Pieter (January 2022, Conference on Robot Learning)

Full Text Available
Adversarial Defense by Stratified Convolutional Sparse Coding

Sun, Bo; Tsai, Nian-Hsuan; Liu, Fangchen; Yu, Ronald; Su, Hao (January 2019, CVPR)

Full Text Available
Adversarial Defense by Stratified Convolutional Sparse Coding

Sun, Bo; Tsai, Nian-Hsuan; Liu, Fangchen; Yu, Ronald; Su, Hao (January 2019, Proceedings - IEEE Computer Society Conference on Computer Vision and Pattern Recognition)

Full Text Available

Search for: All records